Sequences Classi cation by Least General Generalisations

نویسندگان

  • Frédéric Tantini
  • Alain Terlutte
  • Fabien Torre
چکیده

In this paper, we present a general framework for supervised classi cation. This framework provides methods like boosting and only needs the de nition of a generalisation operator called lgg. For sequence classi cation tasks, lgg is a learner that only uses positive examples. We show that grammatical inference has already de ned such learners for automata classes like reversible automata or k-TSS automata. Then we propose a generalisation algorithm for the class of balls of words. Finally, we show through experiments that our method e ciently resolves sequence classi cation tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Convex Hulls in Concept Induction

Classi cation learning is dominated by systems which induce large num bers of small axis orthogonal decision surfaces This strongly biases such systems towards particular hypothesis types but there is reason believe that many domains have underlying concepts which do not involve axis orthog onal surfaces Further the multiplicity of small decision regions mitigates against any holistic appreciat...

متن کامل

Sequences Classification by Least General Generalisations

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau...

متن کامل

DNA Sequence Analysis Using Hierarchical ART-based Classification Network

Adaptive resonance theory ART describes a class of arti cial neural net work architectures that act as classi cation tools which self organize work in real time and require no retraining to classify novel sequences We have adapted ART networks to provide support to scientists attempting to catego rize tandem repeat DNA fragments from Onchocerca volvulus In this approach sequences of DNA fragmen...

متن کامل

Cloud Classi cation Using Error-Correcting Output Codes

Novel arti cial intelligence methods are used to classify 16x16 pixel regions (obtained from Advanced Very High Resolution Radiometer (AVHRR) images) in terms of cloud type (e.g., stratus, cumulus, etc.). We previously reported that intelligent feature selection methods, combined with nearest neighbor classi ers, can dramatically improve classi cation accuracy on this task. Our subsequent analy...

متن کامل

Classi cation of Membrane Proteins by Types of Transmembrane Helices Using SOSUI System

There are many amino acid sequences in proteomes which are not homologous to any other sequences. Therefore, methods to classify proteins independent of the sequence homology are strongly required for computational analysis of proteomes. Proteins may be divided into two categories: soluble and membrane proteins. Since membrane proteins are characterized by the existence of long hydrophobic tran...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010